Personalised Adpcm Speech Playback Devices for the Non-vocal and the Non-vocal Physically Handicapped

نویسنده

  • R. B. Reilly
چکیده

Two personalised instantaneous speech playback devices were designed and developed Both centred upon the memory efficiency advantages of Adaptive Differential Pulse Coded Modulation (ADPCM). One system is handheld and offers 64 seconds of speech. The second system hasdictionary of 230 four second phrases and is PC based. Medical Assessment : The problem of communication for the vocally handicapped is of grave concern. The communication aids currently available for this group of society include text-to-speech synthesizers and portable LPC coders (ref. 1 & 2). The use of these devices is restrictive due to poor quality speech and the use normally of a mid atlantic accent. Some offer good quality speech although are somewhat restrictive in use (ref. 3). The non-vocal physically handicapped also have few systems available to them for communication. The seventy of their disabilities normally limits their use such systems. Following a study conducted by the National Medical Rehabilitation Centre it became evident that a syntheziser system would have to preserve quality accent and sex. Also apparent from this study was the fact that full text-to-speech was not required by the majority of potential users but instead short well spoken phrases. Cost also had a bearing on the communication aid prescribed. An inexpensive instrument would provide a more equitable therapy service. Handheld Communicator: A small ADPCM portable playback device was developed. The electronics of the device was centred on the CMOS OKI Semiconductor MSM5218 speech processor (ref. 4). Using this chip speech is automatically modulated from 12 to 4 bits and stored. The recording circuitry is linked to IBM PC via an U0 interface card. The quantity of speech stored is limited only by the fixed disk capacity of the PC. The speech is input from a microphone, filtered, digitized and compressed. The same speech processing chip is incorporated in the playback unit. The handheld unit also consists of small single chip microcontroller CMOS NEC uPD78C10. A 4 x 4 keyboard is provided to allow access to 16 four second phrases (Memory of 2M bytes). The controller scans the keyboard and selects the memory locations required. Each nibble present at each memory address contains one sample of speech data. The data is expanded and output via a filter and audio amplifier. The use of a keyboard by sufferers of Cerebal Palsy can be awkward. For such users an external switch selection mechanism has been provided. The controller scans each of the four columns in sequence illuminating small LEDs in the process. A positive action on the switch causes the column Dept of Electronic and Electrical Engineering, University College Dublin, Belfield, Dublin4, Ireland corresponding to be selected. The controller then scans each row. A positive action on the switch results in an individual key being selected. Such a switch can be a chin-bar, a suck-or-blow switch or an ultrasonic proximity switch mounted on a wheelchair or bedframe. Keypad icons provide the user with a reminder of the phrase uttered at that key location. Different icons can be inserted behind each key as different phrases are recorded. The length of individual phrases can be altered allowing variable-length messages according to the needs of the user. The allocation of the 16 different key areas can be changed to fom 4 or only 2 areas. This allows those restricted in dexterity to access an area as opposed to one key location. PC Based Communicator: AS the handheld communicator was designed for those who which to use a limited number of phrases in everyday situations, a PC based aid was developed for those who which to use a more sophisticated communicator. The system was designed for use with non-vocal quadraplegics in mind. The communicator uses two switches which are continually monitored. The screen initially displays the first letter of the alphabet four times normal size in the top left hand comer. The use of one switch allows the alphabet to be scrolled. The second switch selects a letter. That letter once chosen results in a dictionary of words starting with that letter being displayed on the screen. The dictionary contains the words most widely used by the patient. The patient can scroll through the dictionary using the first switch and select a word using the second. Accompanying the words in the hctionary are ADPCM encoded versions of these words. Upon building a sentence the selection of Speak from any dictionary window will automatically output the required stored speech data in RAM to circuitry containing the MSM5218 specch processor and an external audio amplifier and loudspeaker. The full sentence will be spoken. The number of words presently stored in the dictionary is approximately 230. This number can be increased although it was found that no more than 200 are regularly used. Use of Communication Aids: The supervision of these communication aids is the responsibility of the speech therapists in the medical location at which they are being used. The choice of phrases for the handheld playback unit is dimted by the therapist. The voice chosen is normally that of an individual similar in sex,dialect and age. The recording of voices of volunteers is carried out by the therapist. The encoded phrases can be replayed and recorded easily before they are downloaded to the playback unit or the dictionary based communicator. The storage within the handheld unit curently consists of EPROM however a RAM version is being developed. For use with the PC based communicator with quadraplegics, the type and the location of the switches is vital. A small training routine is provided to aid in the alignment of the switches. It simultaneously provides instruction to the user on the action of each switch. The use of speech with this communicator has made people within the vacinity of the patient more attentive, compared with that of a screen textwriter.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effects of Size and Type of Vocal Fold Polyp on Some Acoustic Voice Parameters

Background: Vocal abuse and misuse would result in vocal fold polyp. Certain features define the extent of vocal folds polyp effects on voice acoustic parameters. The present study aimed to define the effects of polyp size on acoustic voice parameters, and compare these parameters in hemorrhagic and non-hemorrhagic polyps.Methods: In the present retrospective study, 28 individuals with hemorrha...

متن کامل

Patient-Based Assessment of Effectiveness of Voice Therapy in Vocal Mass Lesions with Secondary Muscle Tension Dysphonia

Introduction: Use of patient-based voice assessment scales is an appropriate method that is frequently used to demonstrate effectiveness of voice therapy. This study was aimed at determining the effectiveness ofvoice therapy among patients with secondary muscle tension dysphonia (MTD) and vocal mass lesions.   Materials and Methods: The study design was prospective, with within-participant repe...

متن کامل

Vocal Parameters of Adults with Down Syndrome in Zahedan /Iran

Background & Aims: Previous studies have indicated significant differences in vocal parameters between children with Down syndrome and normal children, but there are challenges about these differences. In this study vocal parameters and Maximum Phonation Time (MPT) in adults with Down syndrome have been investigated. Method: This cross-sectional and analytic study was performed on 22 adults wit...

متن کامل

Effects of Voice Therapy on Vocal Tract Discomfort in Muscle Tension Dysphonia

Introduction: Patients with muscle tension dysphonia (MTD) suffer from several physical discomforts in their vocal tract. However, few studies have examined the effects of voice therapy (VT) on the vocal tract discomfort (VTD) in patients with voice disorders. Therefore, the aim of the present study was to investigate the effects of VT on the VTD in patients with MTD.   Materi...

متن کامل

The Study of Vocal Function in Patients With Early Laryngeal Carcinoma After Transoral Laser Microsurgery

Objective Today transoral laser microsurgery is considered as one of the first options to control early laryngeal cancer, and voice disorder is one of the inevitable complications of this therapeutic component. This study aimed to compare the vocal function in patients with early-stage laryngeal cancer following laser surgery with healthy individuals with normal voice quality using acoustic ana...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004